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Abstract. We study the problem of discriminating between non-orthogonal quantum states with least probability of error. 
We demonstrate that this problem can be simplified if we solve for the error itself rather than solving directly for the optimal 
measurement. This method enables us to derive solutions directly and thus make definite statements about the uniqueness of 
an optimal strategy. This approach immediately leads us to a state-discrimination analogue of Davies Theorem [1]. 
In the course of this, a complete solution for distinguishing equally likely pure qubit states is presented. 



One cannot perfectly discriminate between non-orthogonal quantum states. It is therefore sensible to ask how one 
can best discriminate such states. 

We consider the situation where we know that a system is in one of a pre-determined set of states {p,}, each of 
which occurs with probability pj, but we do not know which one. Such a situation might arise if we knew the properties 
of the preparation device (or communications protocol), but not the actual setting used on this specific preparation. 
We can make any possible measurement to identify the state, but we must make an identification of the state based on 
the result of the measurement. If the states {p ; } are not mutually orthogonal, then there will be a non-zero probability 
that this identification will be wrong. This is the state discrimination problem. 

In this situation it makes sense to define the optimal measurement strategy to be the one which minimises the 
probability P e of incorrectly identifying the state. This is the most well known example of a hypothesis testing problem, 
as detailed in [2, 3, 4]. 

We describe our measurement strategy with a Probability Operator Measure (POM). This is a set of operators {Ilk] 
which gives the probabilities of each possible measurement outcome P(k\j): 

P(k\j)=Tr(U k pj) (1) 

when the system is in the state pj. The elements (n^) of the POM represent probabilities and therefore must be subject 
to the following conditions: 

Non - negativity : tl k = fl^ > V k (2) 

and 

Completeness '-^tlk = 1- (3) 

k 

Each POM element (ft^) corresponds to the detection of the state Pk- Thus there must be exactly as many POM 
elements as there are possible states, though some of these POM elements may be zero operators corresponding to 
states which are never detected. 



MINIMISING THE ERROR PROBABILITY 

For a measurement strategy to minimise the error probability P e , it must satisfy a known set of necessary and sufficient 
conditions [2, 3, 4]: 

Y,PkPk&k-PjP} - > V 7. (4) 

k 

It is possible to derive [2, 4] a useful necessary condition on the minimum error strategy: 

(C- Pk pk)flk = OVk (5) 



where 

C = I>,pA, (6) 

i 

from the necessary and sufficient conditions (4) and the completeness condition (3). This necessary condition is 
equivalent to stating that the first derivative of the error probability is zero [2, 3]. 

While these conditions do give us a starting point for finding minimum error POMs, they do not themselves provide 
a great insight into either the form of minimum error measurement strategies, or into how error probability depends on 
the set of possible states. For this we must examine the solutions to these conditions, and there are not many solutions. 

The solved cases of the necessary and sufficient conditions can be categorised: 

1. When there are only two possible states [2] 

2. When the states possess some simplifying symmetry [4, 5, 6, 7, 8] 

3. When we have states for which the error cannot be reduced by measurement [9]. 

Only the first of these solutions is directly derived from the conditions (4), the others were all postulated and then 
shown to satisfy these conditions. Simply postulating a solution does not tell us whether that solution is unique. This 
lack of uniqueness does not aid our goal of understanding the irreducible error of measurements on non-orthogonal 
quantum states. 

The essential difficulty in solving the conditions directly (rather than postulating solutions) is that all of the variables 
(tl k ) appear in each condition, and they are not independent variables. 1 However, the operator C (in (5) and (6)) fixes 
the error probability P e as P e = 1 — Tr(C). This is a clue as to how we might solve the necessary and sufficient 
conditions: we will solve them for C rather than for {ft^.}. 2 

We can see that the necessary condition (5), combined with the POM conditions (3), implies the equality (6) 
originally used as the definition of C. Thus we can use these conditions as an alternative definition of C. Furthermore, 
since Tr(C) fixes the error probability, all strategies {tl k } which satisfy the necessary condition (5) for the optimal C 
will be optimal strategies. 

We can now restate the problem as finding an operator C such that: 

{C-p k p k )tl k = 0\/k, (7) 

and, from (4), 

C-Pkh>0Vk, (8) 

for some POM {f^}. Here C is our operator variable, and the optimal POM is derived from C by using (7), (2) and 
(3). 

From this we can immediately see that the optimal strategy might not be unique. The relation between the error (C) 
and the measurement {tl k } is fixed by (7), but (7) places no restriction on {T^rX.)}. The only restriction on these 
traces are the POM conditions (2) and (3), and these conditions cannot uniquely determine all of the {Tr(ITt)} when 
the number of POM elements is large. 

For the purpose of actually finding the solutions, this formulation suggests a two-step procedure: 

1 . Find operators C which can satisfy (7) and (8), then 

2. Check which of these operators leads to elements which can form a POM. 

An important restriction on C for step (1) is that, from (7), we must have either 

Det (C - p k p k ) = or tl k = V k. (9) 

Thus, for each outcome which occurs with non-zero probability, the corresponding C — p k p k must have a zero 
eigenvalue. For discriminating between sets of qubit states this approach becomes especially useful, since then 
C — p k p k and must be proportional to pure state projectors. 

We will demonstrate a simple example of how one use this method to find not an optimal strategy, but all optimal 
strategies for a given class of states. 



This is also why the two states case is more easily soluble: there is only one independent variable. 
2 This is rather reminiscent of the method used in [4] to prove the necessity of the conditions (4). 



EXAMPLE: DISTINGUISHING EQUIPROBABLE PURE QUBIT STATES 



Let us consider the problem of discriminating between a set of N equally likely pure qubit states. We seek to find all 
solutions to the necessary and sufficient conditions (4) for such sets of states. We use the following method: 

We assume that we do not need zero operators as POM elements (see (9)), 
We solve (9) for C, 

We obtain the directions of the POM elements from (7), 
We solve (3) for Tr(ft*). 

If no solution is found, we know that at least one POM element must be a zero operator. We then update our assumption 
accordingly, and try again. We shall begin by looking at a set of three states. 

The most useful basis to describe the three states in is the basis where the diagonal elements of their density operators 
are identical. In this basis the three states share a common latitude of the Bloch Sphere (see figure 1). If we then follow 
the procedure outlined above, we find that (9) is insufficient to completely define the optimal C, but the diagonal part 
of (3) fixes the rest. 
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FIGURE 1. The position on the Bloch sphere of the projectors proportional to the optimal POM elements (dashed arrows) for 
discriminating between a set of three equiprobable pure qubit states (solid arrows). These elements are equatorial in the basis where 
the states share a common latitude and each element has the same longitude as the corresponding state. These elements form a 
POM if and only if every semicircle of the circle of common latitude contains at least one state. 

The solution for three states (see figure 1) is similar to that for the symmetric states [5] in that the optimal POM 
elements are at the Bloch Sphere longitudes of the states and equatorial in latitude, providing that such elements can 
satisfy (3) and so form a POM. C is diagonal in the basis of figure 1, but it does not correspond to the square -root 
strategy [5, 6, 7] unless the three states actually are symmetric states as defined by [5]. 

It is interesting to note that the minimum error in this case depends only on the common latitude of the three states 
and not on their arrangement on that latitude: 

min(P e ) = 1- Pj - 2 P j^(+\p J \+}(-\pj\~), (10) 

which is the same for any of the pjs. It clearly does not depend on the off-diagonal elements of the states' density 
operators. The error is reduced the closer the common latitude of states is to the equator of the Bloch sphere. 

If the elements shown in figure 1 cannot form a POM (i.e. they are all in one half of the Bloch sphere), then the 
optimal strategy is the binary decision strategy [2] which best discriminates between the pair of states with least 
overlap. 

Should we add a fourth state to this picture, we run into some interesting problems. Those properties of C which 
were determined by (9) for the original three states cannot change, as those conditions must still hold. If the fourth (9) 
is not consistent with this, it can only lead to the situation where no C can satisfy both (9) and (3) for all states, and 
some zero operator POM elements must be introduced. The three possibilities for adding a fourth state are: 

1. The new state is on the common latitude of the three original states. Then the optimal C is the same as it was for 
three states. 



2. The new state is in the opposite Bloch Sphere hemisphere from all three states. Then Yuen's solution [4] for states 
which themselves form a POM (when multiplied by suitable coefficients) applies and C« 1. 

3. The new state shares a hemisphere but not a latitude with all three original states. Then no solution exists which 
does not contain at least one zero operator POM element. The optimal strategy will have non-zero elements 
corresponding to the subset of three states whose common latitude is closest to the equator of the Bloch Sphere. 

Further additional states follow the same pattern. The subset of states which the optimal strategy detects will always 
either have common diagonal elements in some basis, or be able to form a POM when multiplied by suitable 
coefficients, or consist of only two states. 



CONCLUSIONS 

We can simplify the problem of finding the best strategy for discriminating between non-orthogonal quantum states 
by solving the necessary and sufficient conditions for minimum error (4) for the error itself rather than solving directly 
for the optimal strategy. This is easier since then we have only one operator variable (C) and there is no issue with 
variables not being independent. 

When you do this is becomes obvious (we have N equations for only one unknown operator) that, in general, some 
states will never be selected because the corresponding POM element will have to be a zero operator. The exceptions 
to this are when the number of states is small, or some limiting symmetry applies to all of them. It is interesting to 
note here that the subset of states which corresponded to non-zero POM elements in the example had to be one of the 
special symmetry cases. 

The example gives the form of the solution for the optimal discrimination between all possible sets of equally likely 
pure qubit states. This example has also shown that the minimum error obtained in the symmetric states case [5] 
is more generally applicable. The corresponding solution described here applies to any arrangement of states with 
common diagonal elements, though this solution is not generally the square-root measurement as described in [5]. It 
can be shown that this result is still valid in higher dimensional bases. 

We have also shown that the weights Tr(LXt) of the POM elements are irrelevant to the optimality of the strategy. 
The only restriction on them is that the POM is complete (3) and thus realisable. This means that we never need more 
than D 2 (D is the dimension of the system) possible outcomes and thus D 2 non-zero POM elements to achieve the 
minimum error. In any situation where we could use more than D 2 elements and still achieve the minimum error, the 
minimum error measurement strategy cannot be unique. There will then be an unlimited number of optimal strategies 
differing only in the weights of their elements. This constitutes a state discrimination equivalent of Davies theorem for 
the accessible information [1]. 
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